Q-learning with Censored Data.

نویسندگان

  • Yair Goldberg
  • Michael R Kosorok
چکیده

We develop methodology for a multistage-decision problem with flexible number of stages in which the rewards are survival times that are subject to censoring. We present a novel Q-learning algorithm that is adjusted for censored data and allows a flexible number of stages. We provide finite sample bounds on the generalization error of the policy learned by the algorithm, and show that when the optimal Q-function belongs to the approximation space, the expected survival time for policies obtained by the algorithm converges to that of the optimal policy. We simulate a multistage clinical trial with flexible number of stages and apply the proposed censored-Q-learning algorithm to find individualized treatment regimens. The methodology presented in this paper has implications in the design of personalized medicine trials in cancer and in other life-threatening diseases.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tracking Interval for Doubly Censored Data with Application of Plasma Droplet Spread Samples

Doubly censoring scheme, which includes left as well as right censored observations, is frequently observed in practical studies. In this paper we introduce a new interval say tracking interval for comparing the two rival models when the data are doubly censored. We obtain the asymptotic properties of maximum likelihood estimator under doubly censored data and drive a statistic for testing the ...

متن کامل

Cost-Sensitive Learning for Recurrence Prediction of Breast Cancer

Breast cancer is one of the top cancer-death causes and specifically accounts for 10.4% of all cancer incidences among women. The prediction of breast cancer recurrence has been a challenging research problem for many researchers. Data mining techniques have recently received considerable attention, especially when used for the construction of prognosis models from survival data. However, exist...

متن کامل

Using the SAS * System to Assess Local Influence in Regression Analysis with Censored Data

This paper describes a SAS" macro for the assessing local influence in regression analysis with censored data. The macro allows one to identify those observations that have the greatest local influence on the estimates of the parameters or functions of the parameters of one's model.

متن کامل

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...

متن کامل

Thurstonian Boltzmann Machines: Learning from Multiple Inequalities

We introduce Thurstonian Boltzmann Machines (TBM), a unified architecture that can naturally incorporate a wide range of data inputs at the same time. Our motivation rests in the Thurstonian view that many discrete data types can be considered as being generated from a subset of underlying latent continuous variables, and in the observation that each realisation of a discrete type imposes certa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Annals of statistics

دوره 40 1  شماره 

صفحات  -

تاریخ انتشار 2012